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ABSTRACT 

We present measurements of the redshift-space luminosity-weighted or ‘marked’ corre¬ 
lation function in the SDSS. These are compared with a model in which the luminosity 
function and luminosity dependence of clustering are the same as that observed, and 
in which the form of the luminosity-weighted correlation function is entirely a con¬ 
sequence of the fact that massive halos populate dense regions. We do this by using 
mock catalogs which are constrained to reproduce the observed luminosity function 
and the luminosity dependence of clustering, as well as by using the language of the 
redshift-space halo-model. These analyses show that marked correlations may show a 
signal on large scales even if there are no large-scale physical effects—the statistical 
correlation between halos and their environment will produce a measureable signal. 
Our model is in good agreement with the measurements, indicating that the halo mass 
function in dense regions is top-heavy; the correlation between halo mass and large 
scale environment is the primary driver for correlations between galaxy properties 
and environment; and the luminosity of the central galaxy in a halo is different from 
(in general, brighter than) that of the other objects in the halo. Thus our measure¬ 
ment provides strong evidence for the accuracy of these three standard assumptions of 
galaxy formation models. These assumptions also form the basis of current halo-model 
based interpretations of galaxy clustering. 

When the same galaxies are weighted by their u—, g—, or r—band luminosities, 
then the marked correlation function is stronger in the redder bands. When the weight 
is galaxy color rather than luminosity, then the data suggest that close pairs of galaxies 
tend to have redder colors. This wavelength dependence of marked correlations is 
in qualitative agreement with galaxy formation models, and reflects the fact that 
the mean luminosity of galaxies in a halo depends more strongly on halo mass in 
the r—band than in u. The m— band luminosity is a tracer of star formation, so our 
measurement suggests that the correlation between star formation rate and halo mass 
is not monotonic. In particular, the luminosity and color dependence we find are 
consistent with models in which the galaxy population in clusters is more massive and 
has a lower star formation rate than does the population in the field. The virtue of 
this measurement of environmental trends is that it does not require classification of 
galaxies into held, group and cluster environments. 

Key words: methods: analytical - galaxies: formation - galaxies: haloes - dark matter 
- large scale structure of the universe 


1 INTRODUCTION 

In hierarchical models of structure formation, there is a 
correlation between halo formation and abundances and 
the surrounding large scale structure—the mass function in 
dense regions is top-heavy (Mo & White 1996; Sheth & Tor- 

* E-mail: ramin@phyast.pitt.edu (RS); 

shethrk@physics.upenn.edu (RKS); ajc@phyast.pitt.edu (AJC) 
scranton@bruno.phyast.pitt.edu (RS) 


men 2002). Galaxy formation models assume that the prop¬ 
erties of a galaxy are determined entirely by the mass and 
formation history of the dark matter halo within which it 
formed. Thus, the correlation between halo properties and 
environment induces a correlation between galaxy proper¬ 
ties and environment. The main goal of the present work 
is to test if this statistical correlation accounts for most of 
the observed trends between luminosity and environment 
(luminous galaxies are more strongly clustered), or if other 
physical effects also matter. 
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We do so by using the statistics of marked correlation 
functions (Stoyan & Stoyan 1994; Beisbart & Kerscher 2002) 
which have been shown to provide sensitive probes of envi¬ 
ronmental effects (Sheth & Tormen 2004; Sheth, Connolly 
& Skibba 2005). The halo model (see Cooray & Sheth 2002 
for a review) is the language currently used to interpret 
measurements of galaxy clustering. Sheth (2005) develops 
the formalism for including marked correlations in the halo 
model of clustering, and Skibba & Sheth (2005) extend this 
to describe measurements made in redshift space. This halo 
model provides an analytic description of marked statistics 
when correlations with environment arise entirely because 
of the statistical effect. 

Sectionl^describes how to construct a mock galaxy cat¬ 
alog in which the luminosity function and the luminosity 
dependence of clustering are the same as those observed in 
the Sloan Digital Sky Survey. In these mock catalogs, any 
correlation with environment is entirely due to the statisti¬ 
cal effect. Sectionl^shows that the halo model description of 
marked statistics provides a good description of this effect, 
both in real and in redshift space. Section 2] compares mea¬ 
surements of marked statistics in the SDSS with the halo 
model prediction. The comparison provides a test of the as¬ 
sumption that correlations with environment arise entirely 
because of the statistical effect. A final section summarizes 
our results, and shows that marked statistics provide inter¬ 
esting information about the correlation between galaxies 
and their environments without having to separate the pop¬ 
ulation into the two traditional extremes of ‘cluster’ and 
‘field’. 


2 WEIGHTED OR MARKED CORRELATIONS 
IN THE ‘STANDARD’ MODEL 


Zehavi et al. (2005) have measured the luminosity depen¬ 
dence of clustering in the SDSS (York et al. 2000; Adelman- 
McCarthy et al. 2005). They interpret their measurements 
using the language of the halo model (see Cooray & Sheth 
2002 for a review). In particular, they describe how the 
distribution of galaxies depends on halo mass in a ACDM 
model with (flo, h, frs) = (0.3, 0.7, 0.9) which is spatially flat. 
In this description, only sufficiently massive halos (Mhaio > 
10 ^^Mq) host galaxies. Each sufficiently massive halo hosts 
a galaxy at its centre, and may host satellite galaxies. 
The number of satellites follows a Poisson distribution with 
a mean value which increases with halo mass (following 
Kravtsov et al. 2004). In particular, Zehavi et al. report that 
the mean number of galaxies with luminosity greater than 
L in halos of mass M is 


Ag,i(> L\M) = 1 -b iVsat(> L\M) = 1 -b 


M 

Mi{L) 


a(L) 


( 1 ) 


if M > Minin(L), and Agai(M) = 0 otherwise. In practice, 
Afmin(L) is a monotonic function of L\ we have found that 
their results are quite well approximated by 

(lOi^h-iMo) IO8/1-2L0) 

Mi{L) « 23Minin(L), and a ~ 1. 

Later in this paper we will also study a parametrization 


in which the cutoff at Mmin is less abrupt: 
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This is motivated by the fact that semi-analytic galaxy for¬ 
mation models show smoother cut-offs at low-masses (Sheth 
& Diaferio 2001; Zheng et al. 2005), and that parameteri- 
zations like this one can also provide good fits to the SDSS 
measurements (Zehavi et al. 2005). 

We use the model in equation 0 to populate halos 
in the 2 = 0.13 outputs of the VLS ACDM simulation 
(Yoshida, Sheth & Diaferio 2001) as follows. We specify a 
minimum luminosity Lmin which is smaller than the mini¬ 
mum luminosity we wish to study. We then select the subset 
of halos in the simulations which have M > Mmin (Lmin). 
We specify the number of satellites each such halo con¬ 
tains by choosing an integer from a Poisson distribution 
with mean Asat(> Lmin|M). We then specify the luminosity 
of each satellite galaxy by generating a random number u 
distributed uniformly between 0 and 1, and finding that L 
for which Nsa.t{> L|M)/Asat(> Lmin|M) = u. This ensures 
that the satellites have the correct luminosity distribution. 
Finally, we distribute the satellites around the halo centre so 
that they follow an NEW profile (see Scoccimarro & Sheth 
2002 for details). We also place a central galaxy at the cen¬ 
tre of each halo. The luminosity of this central galaxy is 
given by inverting the Mmin(L) relation between minimum 
mass and luminosity. We assign redshift space coordinates 
to the mock galaxies by assuming that a galaxy’s velocity 
is given by the sum of the velocity of its parent halo plus a 
virial motion contribution which is drawn from a Maxwell- 
Boltzmann distribution with dispersion which depends on 
halo mass (following equation 1121 below). We insure that 
the centre of mass motion of all the satellites in a halo is 
the same as that of the halo itself by subtracting the mean 
virial motion vector of satellites from the virial motion of 
each satellite (see Sheth & Diaferio 2001 for tests which in¬ 
dicate that this model is accurate). 

The resulting mock galaxy catalog has been constructed 
to have the correct luminosity function (Figure0 as well as 
the correct luminosity dependence of the galaxy two-point 
correlation function. In addition, note that the number of 
galaxies in a halo, the spatial distribution of galaxies within 
a halo, and the assignment of luminosities all depend only on 
halo mass, and not on the surrounding large-scale structure. 
Therefore, the mock catalog includes only those environmen¬ 
tal effects which arise from the environmental dependence 
of halo abundances. 

For reasons described by Sheth, Connolly & Skibba 
(2005), the marked correlation function we measure in the 
mock catalogs is 


M(s) 


1-k W(s) 

i+e(s) ’ 


( 4 ) 


where ^(s) is the two-point correlation function in redshift 
space, and W (s) is the same sum over galaxy pairs separated 
in redshift space by s, but now each member of the pair is 
weighted by the ratio of its luminosity to the mean luminos¬ 
ity of all the galaxies in the mock catalog. (Schematically, 
if the estimator for 1 -|- ^ is DD/RR, then the estimator 
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Figure 1. Luminosity function in the mock catalog (symbols with 
error bars); M refers to the absolute magnitude in the r—band. 
Smooth curve shows the SDSS luminosity function (Blanton et 
al. 2003). 


for 1 + VK is WW/RR, so the estimator we use for M is 
WW/DD.) This measurement of M(s) represents the pre¬ 
diction of the ‘standard’ model: the shape of the luminosity 
weighted correlation function includes the effects of the sta¬ 
tistical correlation between halo mass and environment, but 
no other physical effects. 


3 THE HALO MODEL DESCRIPTION 


This section shows how to describe the marked correlation 
in redshift space discussed above in the language of the halo 
model. Details are in Skibba & Sheth (2005); in essence, the 
calculation combines the results of Sheth (2005) with those 
of Seljak (2001). 

In the halo model, all mass is bound up in dark matter 
halos which have a range of masses. Hence, the density of 
galaxies is 


^gal — 



dn{m) 

dm 


iVgai(m), 


(5) 


where dn(m)/dm denotes the number density of haloes 
of mass m. The redshift space correlation function is the 
Fourier transform of the redshift space power spectrum 
P{k)-. 



dk k^Pik) sinfcs 
k 2n'^ ks 
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In the halo model, P{k) is written as the sum of two terms: 
one that arises from particles within the same halo and dom¬ 
inates on small scales (the 1-halo term), and the other from 
particles in different haloes which dominates on larger scales 
(the 2-halo term). Namely, 


P{k) ^ Pih{k) + P2h{k), 


where, in redshift space. 
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u{k\m) is the Fourier transform of the halo density prohle 
divided by the mass m, H is the Hubble constant, and 


rim) 
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2rvir 


= G 
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( 12 ) 


is the line-of-sight velocity dispersion within a halo (Avir ~ 
200). In addition, the bias factor h(m) describes the strength 
of halo clustering, 

Ev = / — ui(fc|m)&(m), (13) 

J am p 

= f^^^l + NMm)u,ik\m) 

J dm hgai 

/ = dlnZ)(a)/dlna ~ and Punik) is the power spec¬ 

trum of the mass in linear theory. The real space power 
spectrum is given by setting the terms in square brackets in 
equations itTUl and lITTt for Ml and U 2 to unity, and —> 0. 
When explicit calculations are made, we assume that the 
density profiles of halos have the form described by Navarro 
et al. (1996), so u has the form given by Scoccimarro et 
al. (2001), and that halo abundances and clustering are de¬ 
scribed by the parameterization of Sheth & Tormen (1999). 

To describe the effect of weighting each galaxy by its lu¬ 
minosity, let W (r) denote the weighted correlation function, 
and W{k) its Fourier transform. Following Sheth, Abbas & 
Skibba (2004) and Sheth (2005), we write this as the sum of 
two terms: 


where 
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and 


L 



dn{m) 

dm 


LcsYiimt) “1- Nsat{m'j Lmin) 

"^gal 


(17) 


Here L is the average luminosity, Lcen{m) is the luminosity 
of the galaxy at the centre of an m-halo, and (L|m, Lmin) 
is the average luminosity of satellite galaxies more luminous 
than Lmin in m-haloes. Thus, the calculation requires an 
estimate of how the central and the average satellite lumi¬ 
nosity depend on m. As we show below, both are given by 
the luminosity dependence of ^ (i.e., equation 0, so this 
halo model calculation of the weighted correlation function 
requires no additional information! 

The luminosity of the central galaxy is obtained by in¬ 
verting the relation between Mmin and L (e.g., equation I^J. 
Obtaining an expression for the average luminosity of a 
satellite galaxy is more complicated. Define 


H(^ 1771, Tmin) 


Naat{> L\m) 

fVsat(> L min I TTl) 

POO 

J dLp{L\m,Lniin), 


(18) 


where Nsat is given by equation Q. Then the mean lumi¬ 
nosity of satellites in m halos. 


(T|777, min) — 
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can be obtained from the fact that 



dL' P{> L'\m, Lmin) 


(Z/|?77, Lmin) L 


min • 


( 20 ) 


This shows that if we add Lmin to the quantity on the left 
hand side (which is given by integrating equation 0 over L), 
we will obtain the quantity we are after. 

Incidentally, since both Lcen(m) and {L|m, Lmin) can 
be estimated from the SDSS fits, the mean luminosity of 
the galaxies in an m-halo. 


Lav (m, Lmin) 


Lcen (m) -b A'sat (m, L min ){L\rn,L 

min) 

1 “t“ A^sat(m, Lmin) 


( 21 ) 

is completely determined by equation 0 . The mass-to-light 
ratio of an m halo is m/[Agai(m) Lav(m)]: this shows ex¬ 
plicitly that the luminosity dependence of the galaxy corre¬ 
lation function constrains how the halo mass-to-light ratio 
must depend on halo mass. This halo-mass dependence has 
been used by Tinker et al. (2005); our analysis provides an 
analytic calculation of the effect. It shows that, in low mass 
halos. Lav ~ Lcen becausc Nsat <C 1, whereas in massive 
halos. Lav < Lcen. Figurecompares the mass dependence 
of Lav, Lcen, and Lsat for galaxies restricted to Mr < —20.5 
as predicted by Zehavi et al.’s (2005) halo model interpreta¬ 
tion of the luminosity dependence of clustering in the SDSS. 
The symbols show measurements from our mock catalogs. 
The different quantities scale very differently with halo mass, 
with the following consequence. 

Equation iPll treats the central galaxies differently 
from the others. If the luminosities of the central galax¬ 
ies were not special, then the contribution to the one-halo 
term would scale as A^satLav for the centre-satellite term, 
and (AsatLav)^ for the satellite-satellite term. Note that, in 
this case, the luminosity weights are the same for the two 
types of terms—only the number weighting differs. However, 



1012 1013 1014 1015 
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Figure 2. Mean number of galaxies in a halo (top) and mean lu¬ 
minosity in a halo (bottom) for SDSS galaxies with Mr < —20.5, 
as a function of the masses of their parent halos, predicted by 
the luminosity dependence of clustering. Different curves in bot¬ 
tom panel show the mean luminosity of the galaxies in a halo, 
the luminosity of the central galaxy, and the mean luminosity of 
the others, as a function of halo mass (solid, dashed, and dotted 
curves). Symbols show the result of computing these relations in 
our mock catalogs. 


because the mass dependence of Lav is so different from that 
of the other two terms (cf. Figure0, marked statistics allow 
one to discriminate between models which treat the central 
object as special from models which do not (e.g. Sheth 2005). 

To illustrate, the symbols in the top and bottom panels 
of FigureOshow measurements of ^(r) and M{r) measured 
in this mock catalog. Error bars were obtained with a “jack¬ 
knife” procedure, as detailed in Scranton et al. (2002), in 
which the statistic is re-measured after omitting a random 
region, and repeating thirty times (~1.5 times the number 
of bins in separation for which we present results). Note 
that the errors in W are strongly correlated with those in 
5, so that the true error in M is grossly (more than a factor 
of ten) overestimated if one simply sums these individual 
errors in quadrature. A much better approximation of the 
uncertainties is obtained as follows. Randomly scramble the 
marks among the galaxies, remeasure M, and repeat many 
(~ 100) times. Compute the mean of M over these realiza¬ 
tions. This mean, and the rms scatter around it are shown 
as dotted lines in the two panels. Note that this scatter is 
within a factor of two of the full jackknife error estimate; it is 
smaller than the jackknife estimate on scales r > 1/7“ ^Mpc 
and s > 3 / 7 “'^Mpc, and, on smaller scales, it is larger than 
the jackknife estimate. 

The solid lines in the top panel show the halo model 
calculation of These show that the model is in excellent 
agreement with the measurements on all scales in real space. 
The solid and dashed curves in the bottom panel show the 
associated halo model calculations of the marked statistic 
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r (Mpc/h) 

Figure 3. Luminosity-weighted real-space correlation function 
measured in a mock catalogs which resembles an SDSS volume 
limited sample with < —20.5. Symbols show the measure¬ 
ments; smooth curves show the associated halo-model predictions 
when the luminosity of the central galaxy in a halo is assumed 
to be different from the others; dashed curves show the predic¬ 
tion when the central object is not special. Dotted curves show 
the mean and rms values of the statistic M, obtained by after 
randomizing the marks and remeasuring M one hundred times. 


M when central galaxies are special (solid), and when they 
are not (dashed). Note that both these curves give the same 
prediction for the unweighted statistic 

Comparison of these curves with the measurements 
yields two important pieces of information. First, on large 
scales (r > 4/i“^Mpc), the solid and dashed curves are iden¬ 
tical, and they are in excellent agreement with the measure¬ 
ments. This indicates the large-scale signal is well described 
by a model in which there are no additional correlations with 
environment other than those which arise from the correla¬ 
tion between halo mass and environment. This is not reas¬ 
suring, since the mock catalogs were constructed to have no 
correlations other than those which are due to halo bias. 
Second, on smaller scales, the solid curves are in substan¬ 
tially better agreement with the measurements than are the 
dashed curves. (A estimate of the goodness of fit of the 
two marked correlation models yields values which smaller 
by a factor of ten when the central galaxy is treated spe¬ 
cially compared to when it is not.) Since the mock catalogs 
do treat the central galaxies differently from the others, it 
is reassuring that the halo model calculation which incorpo¬ 
rates this difference is indeed in better agreement with the 
measurements. 

In the next section, we will present measurements of 
marked statistics in redshift space. To see if we can use our 
halo model calculation to interpret the measurements, Fig- 
ure0]compares measurements of ^(s) and M(s) in the mock 
catalog with our halo model calculation. The format is sim¬ 
ilar to Figure El solid curves in the bottom panels show the 
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Figure 4. Luminosity-weighted redshift-space correlation func¬ 
tions measured in mock catalogs which resemble an SDSS volume 
limited sample with < —20.5. Symbols show the measure¬ 
ments; smooth curves show the associated halo-model predictions 
when the luminosity of the central galaxy in a halo is assumed 
to be different from the others; dashed curves show the predic¬ 
tion when the central object is not special. Dotted curves show 
the mean and rms values of the statistic M, obtained by after 
randomizing the marks and remeasuring M one hundred times. 
Dot-dashed curves in show the one- and two-halo contributions 
to the statistic in our model when the central object in a halo is 
special. 

predicted marked statistic M when the central galaxy in a 
halo is treated differently from the others, and dashed curves 
show what happens if it is not. Both curves give the same 
prediction for the unweighted statistic 

The top panel shows that the halo model calculation 
of 5(s) is in excellent agreement with the measurements on 
scales larger than a few Mpc, as it was for ^(r). However, it 
is not as accurate when the redshift separations are of order 
a few Mpc. Nevertheless, the model is able to reproduce 
the factor of ten difference between ^(r) and ^(s) on small 
scales. We will discuss the reason for the discrepancy on 
intermediate scales shortly. 

Similarly, the bottom panel shows excellent agreement 
between measurements and model for the marked statistic 
M{s) on large scales (s > 8/i“^Mpc), both when the central 
object is treated specially and when it is not. In addition, 
the model in which the central object is special is in better 
agreement with the measurements on small scales. (A 
estimate of the goodness of fit of the two marked correlation 
models yields values which smaller by more than a factor of 
two when the central galaxy is treated specially compared 
to when it is not.) On intermediate scales, however, there is 
substantial discrepancy discrepancy between the model and 
the mocks; the discrepancy is more pronounced for M{s) 
than for ^(s). 

To study the cause of this discrepancy, dot-dashed lines 
show the two contributions to the statistic, + C) 
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and (1 + 14 ^ 2 / 1 ) 7(1 + C )5 separately. This shows that it is 
on scales where both terms contribute that the model is 
inaccurate. There are two reasons why it is likely that this 
inaccuracy can be traced to our simple treatment of the two- 
halo term. The suppression of power due to virial motions 
means that we must model the two-halo term more accu¬ 
rately in redshift-space than in real-space. Our halo-model 
calculation incorrectly assumes that linear theory is a good 
approximation even on small scales (e.g. Scoccimarro 2004 
shows that this is a dangerous assumption even on scales 
of order 10 Mpc) and that volume exclusion effects (Mo & 
White 1996) are negligible (Sheth & Lemson 1999 discuss 
how one might incorporate such effects). Because our mocks 
make use of both the positions and velocities of the halos in 
the simulations, they incorporate both these effects. Thus, 
our simple halo-model likely underestimates M{s) on inter¬ 
mediate scales, but overestimates it on smaller scales. Since 
this is in the sense of the discrepancy with the measure¬ 
ments in the mock catalogs, it is likely that this inaccuracy 
can be traced to our simple treatment of the two-halo term. 
We will have cause to return to this discrepancy in the next 
section, where we use our halo model calculation to interpret 
measurements of marked statistics in the SDSS dataset. 


4 MEASUREMENTS IN THE SDSS 

Figurel^shows C(’'pi '^) a-nd W{vp, tt), the unweighted (solid) 
and weighted (dashed) correlation functions of pairs with 
separations and tt, perpendicular and parallel to the line 
of sight. The measurements were made in a volume limited 
catalog (59,293 galaxies with Mr < —20.5) extracted from 
the SDSS DR4 database (Adelman-McCarthy et al. 2005). 
Contours show the scales at which the correlation functions 
have values of 0.1, 0.2, 0.5, 1, 2, and 5, when averaged over 
bins of 2 / 1 “'^Mpc in rp and tt. This format, due to Davis & 
Peebles (1983), allows one to isolate redshift space effects 
on the correlation functions, since these act only in the n 
direction. The dotted quarter circles at separations of 5, 10 
and 20h“'^Mpc are drawn to guide the eye—they serve to 
highlight the fact that both ^{rp,TT) and W{rp,-K) are very 
anisotropic. In constrast, the corresponding real-space quan¬ 
tities would be isotropic. The figure shows clearly that W 
has a slightly higher amplitude than 7 on the scales shown. 

The quantity studied in the previous section, for which 
we have analytic (halo-model) estimates, can be derived 
from this plot as follows. Counting pairs in spherical shells 
of radius s = CVp + yields the redshift space correla¬ 
tion function ^(s). This measure of clustering is sensitive 
to the fact that the correlation function in redshift-space is 
anisotropic; in particular, it contains information about the 
typical motions of galaxies within halos (which are respon¬ 
sible for the elongation of the contours along the tt direction 
at Tp < 5h~ Mpc), as well as the motions of the halos them¬ 
selves (which are responsible for the squashing along the tt 
direction at rp > 5h~^ Mpc). The result of counting pairs of 
constant Vp, whatever their value of tt, yields the projected 
correlation function Wp{rp); since Vp is not affected by red¬ 
shift space distortions, this quantity contains no information 
about galaxy or halo motions, so is more closely related to 
the real-space correlation function. Figures]^ and |7| compare 



Figure 5. Unweighted (solid) and weighted (dashed) correlation 
functions measured in volume limited catalog with Mr < —20.5 
in the SDSS. Dotted curves show that the measured correlation 
functions are significantly anisotropic. 

both these quantities with the corresponding halo-model cal¬ 
culations. 

Figure |H] shows ^(s) and M{s) measured in two volume 
limited catalogs extracted from the SDSS database. One of 
these catalogs is the same as that which resulted in Figure^ 
and the other is for a slightly fainter sample (Mr < —19.5, 
with 61,821 galaxies). Error bars are estimated by jack-knife 
resampling, as discussed previously. The solid curves show 
the redshift-space halo-model calculation in which central 
galaxies are special, and dashed curves show the expected 
signal if they are not. (Recall that both have the same C(s)-) 

On large scales, both the solid and dashed curves pro¬ 
vide an excellent description of the measurements on large 
scales. This agreement suggests that correlations with en¬ 
vironment on scales larger than a few Mpc are entirely a 
consequence of the correlation between halo abundances and 
environment, just as they were in the mock catalogs. Since 
the model calculation incorporates the assumption that the 
halo mass function is top-heavy in dense regions, the agree¬ 
ment with the measured M(s) provides strong evidence that 
this is indeed the case. 

The discrepancy between the halo-model calculation 
and the measurements on intermediate scales is similar to 
the discrepancy between the halo-model and the mock cat¬ 
alogs studied in the previous section. There we argued that 
this is almost certainly due to our simple treatment of the 
two-halo contribution to the statistic. Indeed, the marked 
statistics in the mock catalogs behave qualitatively like those 
in the SDSS data (compare Figures |^a,nd|^. suggesting that 
the discrepancy between the halo-model calculation and the 
measurements are due to this, rather than to any environ¬ 
mental effects operating on intermediate scales. 

On small scales, the solid curves are in substantially bet¬ 
ter agreement with the data than are the dashed curves (x^ 
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Figure 6. Redshift-space correlation functions measured in volume limited catalogs with Mr < —19.5 (left) and Mr < —20.5 (right) in 
the SDSS. Top panels show the unweighted correlation function $(s), and bottom panels show the marked statistic M(s). Smooth curves 
show the associated redshift-space halo-model predictions; solid curves are when the central galaxy in a halo is treated differently from 
the others, whereas this is not done for the dashed curves. Dotted curves show the mean and rms values of the statistic M, estimated 
by randomizing the marks and remeasuring M one hundred times. Two sets of curves are shown in the right hand panels; the top set of 
solid and dashed curves shows the halo model calculation in which the relation between the number of galaxies and halo mass is given 
by equation 0: and the bottom set follow from equation 13- 



Figure 7. Projected correlation function measured in volume lim¬ 
ited catalog with Mr < —20.5 in the SDSS. Top panel shows the 
unweighted projected correlation function Wp{rp), and bottom 
panels show the marked statistic Mp{rp). Smooth curves show 
the associated projected halo-model predictions; solid curves are 
when the central galaxy in a halo is treated differently than the 
others, whereas this is not done for the dashed curves. The up¬ 
per set of dashed and solid curves show halo-model calculations 
which follow from equation 0 ; the lower set of curves assume 
equation 0. 


smaller by a factor of four in both plots). Evidently, central 
galaxies are indeed a special population in the data. This 
provides substantial support for the assumption commonly 
made in halo-model interpretations of the galaxy correlation 
function that the central galaxy in a halo is different from 
all the others. 

However, even the solid curves are not in particularly 
good agreement with the measurements. Before attributing 
the discrepancy to environmental effects not included in the 
halo model description, we have explored the effect of mod¬ 
ifying our parametrization of the relation between the num¬ 
ber of galaxies and halo mass which we use (equation 0. 
Figure 0 shows that the parametrization in equation 0, 
with (T = 0.5 and Mi/Mmin = 30, provides equally good fits 
to ^(s), but a slightly better description of M{s). In this 
parameterization of the scaling of Vgai with halo mass, the 
minimum halo mass required to host a galaxy is not a sharp 
step function. 

Further evidence in support of the parametrization in 
which the minimum mass cutoff is not sharp, and in which 
the central galaxy is different from the others is shown in 
Figure0 The top and bottom panels compare measurements 
of the projected correlation functions Wpirp) and Mp{rp), 
where 


Wp{rp) = J dy^{rp,y) = 2 J 

Mp{rp) = i + Wpirp)/rp 


dr ■ 


r^(r) 


’ 

where 


and 


l+Wp{rp)/rp ’ 

Wp{rp) = 2/ dr and r = (22) 

Jrp V’' ^ 

with the associated halo-model calculations. (In the halo 
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model, the real-space quantities ^(r) and W(r) which ap¬ 
pear in the expressions above, are related to ^(s) and VK(s) 
by setting setting _Fv = 0 and taking the limit (Jvir —> 0 in 
Ml and U 2 . See Skibba & Sheth 2006 for our particular defi¬ 
nition of Mp.) Note that these projected quantities are free 
of redshift space distortions, making them somewhat easier 
to interpret. 

As was the case for the redshift space measurements, 
both parameterizations of Agai(iVf) provide good descrip¬ 
tions of the unweighted statistic WpRp), and in both cases, 
the weighted statistic is in better agreement when the cen¬ 
tral object is treated specially. However, the Figure shows 
clearly that when the central object is special, then equa¬ 
tion 0 provides a substantially better description of Mp — 
the agreement with the measurements is excellent over all 
scales. 


5 DISCUSSION 

We showed how to generate a mock galaxy catalog which 
has the same luminosity function (Figure 0 and luminos¬ 
ity dependent two-point correlation function as the SDSS 
data. We used the mock catalog to calculate the luminosity- 
weighted correlation function in a model where all environ¬ 
mental effects are a consequence of the correlation between 
halo mass and environment (Figures 0 and 0 show results 
in real and redshift space). We then showed how to de¬ 
scribe this luminosity-weighted correlation function in the 
language of the redshift-space halo model (equation I15II . 
The analysis showed that estimates of the luminosity de¬ 
pendence of clustering constrain how the mass-to-light ratio 
of halos depends on halo mass feauation 1211 and Figure 0 . 
The central galaxy in a halo is predicted to be substantially 
brighter than the other objects in the halo, and although 
the luminosity of the central object increases rapidly with 
halo mass, the mean luminosity of the other objects in the 
halo is approximately independent of the mass of the host 
halo. 

Our analysis also showed that measurements of clus¬ 
tering as a function of luminosity completely determine the 
simplest halo model description of marked statistics. In addi¬ 
tion, measurements of the marked correlation function allow 
one to discriminate between models which treat the central 
object in a halo as special, from those which do not (Fig- 
ures0and|lj. Also, in hierarchical galaxy formation models, 
the marked correlation function is expected to show a signal 
on large scales if the average mark of the galaxies in a halo 
correlates with halo mass. The signal arises because massive 
halos populate the densest regions; it is present even if there 
are no physical effects which operate to correlate the marks 
over large scales. 

We compared this halo model of marked statistics with 
measurements in the SDSS (Figures 0 and 0. The agree¬ 
ment between the model and the measurements on scales 
smaller than a few Mpc provides strong evidence that cen¬ 
tral galaxies in halos are a special population—in general, 
the central galaxy in a halo is substantially brighter than 
the others. (Berlind et al. 2004 come to qualitatively sim¬ 
ilar conclusions, but from a very different approach.) Sub¬ 
stantially better agreement is found for a model in which 
the minimum halo mass required to host a luminous cen¬ 


tral galaxy does not change abruptly with luminosity. This 
is in qualitative agreement with some semi-analytic galaxy 
formation models, which generally predict some scatter in 
central luminosity at fixed halo mass (e.g. Sheth & Diaferio 
2001; Zheng et al. 2005). 

The agreement between the halo model calculation and 
the data on scales larger than a few Mpc indicates that the 
standard assumption in galaxy formation models, that halo 
mass is the primary driver of correlations between galaxy lu¬ 
minosity and environment, is accurate. In particular, these 
measurements are consistent with a model in which the halo 
mass function in large dense regions is top-heavy, and, on 
these large scales, there are no additional physical or sta¬ 
tistical effects which affect the luminosities of galaxies. In 
this respect, our conclusions are similar to those of Mo et 
al. (2004), Kauffmann et al. (2004), Blanton et al. (2005), 
and Abbas & Sheth (2006), although our methods are very 
different. 

We note in passing that there is a weak statistical ef¬ 
fect for which the halo-model above does not account: at 
fixed mass, haloes in dense regions form earlier (Sheth & 
Tormen 2004). Gao, Springel & White (2005) show that 
this effect is more pronounced for low mass haloes (related 
marked correlation function analyses by Harker et al 2006 
and Wechsler et al. 2006 come to similar conclusions). The 
agreement between our halo-model calculation and the mea¬ 
surements in the SDSS suggests that this correlation be¬ 
tween halo formation and environment is not important for 
the relatively bright galaxy population we have studied here. 
This is presumably because these SDSS galaxies populate 
more massive haloes. Comparisons with larger upcoming 
SDSS datasets, with a fainter luminosity threshold (such as 
Mr < —18), may bear out the correlation between low-mass 
halo formation and environment. 

As a final indication of the information contained in 
measurements of marked statistics, Figure|H]compares M(s) 
when the m— , g— and r—band luminosities are used as the 
mark. The underlying population is the same as that for 
Figures mn the sample is volume limited to Mr < —20.5. 
Thus, ^(s) is fixed, and only W{s) changes with wave-band. 
Notice that there is a clear trend with wavelength: there 
is a slight anti-correlation in the m— band, whereas M{s) 
rises slightly with decreasing scale when p—band luminosity 
is the mark, and the signal is even stronger when Lr is the 
mark. This trend with wavelength is qualitatively consistent 
with the predictions of semi-analytic galaxy formation mod¬ 
els (Sheth, Connolly & Skibba 2005) and indicates that the 
mean M-band luminosity of the galaxies in a halo depends 
less strongly on halo mass than does the mean r—band lu¬ 
minosity in a halo (Sheth 2005). In the models, the m— band 
luminosity is an indicator of the star formation rate; our 
measurements suggest that the correlation between star for¬ 
mation rate and halo mass is weak—if it is an increasing 
function of halo mass at low masses, then it decreases at 
larger masses. 

Figure0shows the result of weighting these same galax¬ 
ies by their colors. The top panel shows results where the 
weight is the difference in the absolute magnitudes. Mu —Mr 
and Mg — Mr, whereas the weights in the bottom bottom 
panel were the ratios of the luminosities in two bands. Com¬ 
parison of the two panels shows the effect on M{s) of rescal¬ 
ing the weights while preserving their rank-ordering—while 
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Figure 8. Redshift-space luminosity-weighted correlation func¬ 
tions measured in volume limited catalogs with Mr < —20.5 in 
the SDSS. Circles, triangles and squares show M(s) when the 
weight is r—, g— and u—band luminosity respectively. For clarity, 
jack-knife error bars are only shown for the r—band measurement, 
since the uncertainties are similar in the other bands. 

there are quantitative differences, the results in both panels 
are qualitatively similar. The M{s) measurements shown in 
the bottom panel are more widely separated because the lu¬ 
minosity ratio involves 10'^°^°'^, which has the effect of weight¬ 
ing the redder galaxies more heavily. In particular, this anal¬ 
ysis indicates clearly that close pairs of galaxies tend to be 
redder than average. Sheth, Connolly & Skibba (2005) show 
that this is also the case in semi-analytic galaxy formation 
models. 

The measurements shown in Figures and are con¬ 
sistent with models in which galaxies in clusters are more 
massive and have smaller star formation rates than galax¬ 
ies in the field. In effect, these figures demonstrate the en¬ 
vironmental dependence of galaxy luminosities and colors, 
without having to divide the galaxy sample up into discrete 
bins of ‘field’, ‘group’, and ‘cluster’. Thus, marked statis¬ 
tics allow one to study correlations with environment over a 
continuous range in density, rather than in somewhat arbi¬ 
trary discrete bins in environment. In this respect, our use 
of marked statistics to quantify and interpret environmental 
trends is very different from recent approaches which ad¬ 
dress the same problem. Since marked statistics are simple 
to measure and interpret, we hope that they will become 
standard tools for quantifying the correlation between the 
properties of galaxies and their environments. 
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